170 resultados para GENOMIC SEQUENCE

em Indian Institute of Science - Bangalore - Índia


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The genomic sequences of several RNA plant viruses including cucumber mosaic virus, brome mosaic virus, alfalfa mosaic virus and tobacco mosaic virus have become available recently. The former two viruses are icosahedral while the latter two are bullet and rod shaped, respectively in particle morphology. The non-structural 3a proteins of cucumber mosaic virus and brome mosaic virus have an amino acid sequence homology of 35% and hence are evolutionarily related. In contrast, the coat proteins exhibit little homology, although the circular dichroism spectrum of these viruses are similar. The non-coding regions of the genome also exhibit variable but extensive homology. Comparison of the brome mosaic virus and alfalfa mosaic virus sequences reveals that they are probably related although with a much larger evolutionary distance. The polypeptide folds of the coat protein of three biologically distinct isometric plant viruses, tomato bushy stunt virus, southern bean mosaic virus and satellite tobacco necrosis virus have been shown to display a striking resemblance. All of them consist of a topologically similar 8-standard β-barrel. The implications of these studies to the understanding of the evolution of plant viruses will be discussed.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The 3prime terminal 1255nt sequence of Physalis mottle virus (PhMV) genomic RNA has been determined from a set of overlapping cDNA clones. The open reading frame (ORF) at the 3prime terminus corresponds to the amino acid sequence of the coat protein (CP) determined earlier except for the absence of the dipeptide, Lys-Leu, at position 110-111. In addition, the sequence upstream of the CP gene contains the message coding for 178 amino acid residues of the C-terminus of the putative replicase protein (RP). The sequence downstream of the CP gene contains an untranslated region whose terminal 80 nucleotides can be folded into a characteristic tRNA-like structure. A phylogenetic tree constructed after aligning separately the sequence of the CP, the replicase protein (RP) and the tRNA-like structure determined in this study with the corresponding sequences of other tymoviruses shows that PhMV wrongly named belladonna mottle virus [BDMV(I)] is a separate tymovirus and not another strain of BDMV(E) as originally envisaged. The phylogenetic tree in all the three cases is identical showing that any subset of genomic sequence of sufficient length can be used for establishing evolutionary relationships among tymoviruses.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The 3' terminal 1255 nt sequence of Physalis mottle virus (PhMV) genomic RNA has been determined from a set of overlapping cDNA clones. The open reading frame (ORF) at the 3' terminus corresponds to the amino acid sequence of the coat protein (CP) determined earlier except for the absence of the dipeptide, Lys-Leu, at position 110-111. In addiition, the sequence upstream of the CP gene contains the message coding for 178 amino acid residues of the C-terminus of the putative replicase protein (RP). The sequence downstream of the CP gene contains an untranslated region whose terminal 80 nucleotides can be folded into a characteristic tRNA-like structure. A phylogenetic tree constructed after aligning separately the sequence of the CP, the replicase protein (RP) and the tRNA-like structure determined in this study with the corresponding sequences of other tymoviruses shows that PhMV wrongly named belladonna mottle virus [BDMV(I)] is a separate tymovirus and not another strain of BDMV(E) as originally envisaged. The phylogenetic tree in all the three cases is identical showing that any subset of genomic sequence of sufficient length can be used for establishing evolutionary relationships among tymoviruses.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The rapid increase in genome sequence information has necessitated the annotation of their functional elements, particularly those occurring in the non-coding regions, in the genomic context. Promoter region is the key regulatory region, which enables the gene to be transcribed or repressed, but it is difficult to determine experimentally. Hence an in silico identification of promoters is crucial in order to guide experimental work and to pin point the key region that controls the transcription initiation of a gene. In this analysis, we demonstrate that while the promoter regions are in general less stable than the flanking regions, their average free energy varies depending on the GC composition of the flanking genomic sequence. We have therefore obtained a set of free energy threshold values, for genomic DNA with varying GC content and used them as generic criteria for predicting promoter regions in several microbial genomes, using an in-house developed tool `PromPredict'. On applying it to predict promoter regions corresponding to the 1144 and 612 experimentally validated TSSs in E. coli (50.8% GC) and B. subtilis (43.5% GC) sensitivity of 99% and 95% and precision values of 58% and 60%, respectively, were achieved. For the limited data set of 81 TSSs available for M. tuberculosis (65.6% GC) a sensitivity of 100% and precision of 49% was obtained.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

X-ray diffraction studies on single crystals of a few viruses have led to the elucidation of their three dimensional structure at near atomic resolution. Both the tertiary structure of the coat protein subunit and the quaternary organization of the icosahedral capsid in these viruses are remarkably similar. These studies have led to a critical re-examination of the structural principles in the architecture of isometric viruses and suggestions of alternative mechanisms of assembly. Apart from their role in the assembly of the virus particle, the coat proteins of certian viruses have been shown to inhibit the replication of the cognate RNA leading to cross-protection. The coat protein amino acid sequence and the genomic sequence of several spherical plant RNA viruses have been determined in the last decade. Experimental data on the mechanisms of uncoating, gene expression and replication of several classes of viruses have also become available. The function of the non-structural proteins of some viruses have been determined. This rapid progress has provided a wealth of information on several key steps in the life cycle of RNA viruses. The function of the viral coat protein, capsid architecture, assembly and disassembly and replication of isometric RNA plant viruses are discussed in the light of this accumulated knowledge.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Sixteen million nucleotide sequence of genome of various organisms have been analysed to detect and study the extent of occurrence of simple repetitive sequences. Two sequence motifs (TG/CA)n and (CT/AG)n capable of adopting unusual DNA structures, left handed Z-conformation and triple-helical conformation respectively, are found to be abundant in rodent and human genomes, but almost completely absent in bacterial genome. (TG/CA)n and (CT/AG)n sequences are present mostly in the intron or 5'/3' flanking regions of the genes. The presence of such repeat motifs in genomic sequence of higher eukaryotes has been correlated with their possible functional significance in nucleosome organization, recombination and gene expression.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

X-ray diffraction studies on single crystals of a few viruses have led to the elucidation of their three dimensional structure at near atomic resolution. Both the tertiary structure of the coat protein subunit and the quaternary morganization of the icosahedral capsid in these viruses are remarkably similar. These studies have led to a critical re-examination of the structural principles in the architecture of isometric viruses and suggestions of alternative mechanisms of assembly. Apart from their role in the assembly of the virus particle, the coat proteins of certian viruses have been shown to inhibit the replication of the cognate RNA leading to cross-protection. The coat protein amino acid sequence and the genomic sequence of several spherical plant RNA viruses have been determined in the last decade. Experimental data on the mechanisms of uncoating, gene expression and replication of several classes of viruses have also become available. The function of the non-structural proteins of some viruses have been determined. This rapid progress has provided a wealth of information on several key steps in the life cycle of RNA viruses. The function of the viral coat protein, capsid architecture, assembly and disassembly and replication of isometric RNA plant viruses are discussed in the light of this accumulated knowledge.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Thiolases are important in fatty-acid degradation and biosynthetic pathways. Analysis of the genomic sequence of Mycobacterium smegmatis suggests the presence of several putative thiolase genes. One of these genes appears to code for an SCP-x protein. Human SCP-x consists of an N-terminal domain (referred to as SCP2 thiolase) and a C-terminal domain (referred as sterol carrier protein 2). Here, the cloning, expression, purification and crystallization of this putative SCP-x protein from M. smegmatis are reported. The crystals diffracted X-rays to 2.5 angstrom resolution and belonged to the triclinic space group P1. Calculation of rotation functions using X-ray diffraction data suggests that the protein is likely to possess a hexameric oligomerization with 32 symmetry which has not been observed in the other six known classes of this enzyme.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Physical clustering of genes has been shown in plants; however, little is known about gene clusters that have different functions, particularly those expressed in the tomato fruit. A class I 17.6 small heat shock protein (Sl17.6 shsp) gene was cloned and used as a probe to screen a tomato (Solanum lycopersicum) genomic library. An 8.3-kb genomic fragment was isolated and its DNA sequence determined. Analysis of the genomic fragment identified intronless open reading frames of three class I shsp genes (Sl17.6, Sl20.0, and Sl20.1), the Sl17.6 gene flanked by Sl20.1 and Sl20.0, with complete 5' and 3' UTRs. Upstream of the Sl20.0 shsp, and within the shsp gene cluster, resides a box C/D snoRNA cluster made of SlsnoR12.1 and SlU24a. Characteristic C and D, and C' and D', boxes are conserved in SlsnoR12.1 and SlU24a while the upstream flanking region of SlsnoR12.1 carries TATA box 1, homol-E and homol-D box-like cis sequences, TM6 promoter, and an uncharacterized tomato EST. Molecular phylogenetic analysis revealed that this particular arrangement of shsps is conserved in tomato genome but is distinct from other species. The intronless genomic sequence is decorated with cis elements previously shown to be responsive to cues from plant hormones, dehydration, cold, heat, and MYC/MYB and WRKY71 transcription factors. Chromosomal mapping localized the tomato genomic sequence on the short arm of chromosome 6 in the introgression line (IL) 6-3. Quantitative polymerase chain reaction analysis of gene cluster members revealed differential expression during ripening of tomato fruit, and relatively different abundances in other plant parts.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Genome sequences contain a number of patterns that have biomedical significance. Repetitive sequences of various kinds are a primary component of most of the genomic sequence patterns. We extended the suffix-array based Biological Language Modeling Toolkit to compute n-gram frequencies as well as n-gram language-model based perplexity in windows over the whole genome sequence to find biologically relevant patterns. We present the suite of tools and their application for analysis on whole human genome sequence.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Despite more than 40 years of intense study, essential features of the silkmoth chorion (eggshell) are still not fully understood. To determine the precise structure of the chorion locus, we performed extensive EST analysis, constructed a bacterial artificial chromosome (BAC) contig, and obtained a continuous genomic sequence of 871,711 base pairs. We annotated 127 chorion genes in two segments interrupted by a 164 kb region with 5 non-chorion genes, orthologs of which were on chorion bearing scaffolds in 4 ditrysian families. Detailed transcriptome analysis revealed expression throughout choriogenesis of most chorion genes originally categorized as ``middle'', and evidence for diverse regulatory mechanisms including cis-elements, alternative splicing and promoter utilization, and antisense RNA. Phylogenetic analysis revealed multigene family associations and faster evolution of early chorion genes and transcriptionally active pseudogenes. Proteomics analysis identified 99 chorion proteins in the eggshell and micropyle localization of 1 early and 6 Hc chorion proteins.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In recent years, identification of sequence patterns has been given immense importance to understand better their significance with respect to genomic organization and evolutionary processes. To this end, an algorithm has been derived to identify all similar sequence repeats present in a protein sequence. The proposed algorithm is useful to correlate the three-dimensional structure of various similar sequence repeats available in the Protein Data Bank against the same sequence repeats present in other databases like SWISS-PROT, PIR and Genome databases.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Using the polymerase chain reaction, the coding sequence for peanut agglutinin (PNA) was cloned and expressed in Escherichia coli. Amplified PNA is identical to previously reported cDNA, suggesting the absence of any introns in PNA gene. Recombinant (re-) PNA forms inclusion bodies in E. coli. Production of PNA was confirmed by probing Western blots with polyclonal anti-PNA immunoglobulin G. Inclusion bodies were solubilized with 6 M guanidine-HCl and renatured by rapid dilution in the presence of metal ions. The renatured lectin was then purified by affinity chromatography. The re-lectin shows carbohydrate-binding properties similar to the natural PNA. This expression system provides a model for future mutagenesis studies of the carbohydrate-binding site and thus facilitates ongoing efforts to explore the molecular basis for the specificity of lectin-carbohydrate interaction.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background & objectives: Periplasmic copper and zinc superoxide dismutase (Cu,Zn-SOD or SodC) is an important component of the antioxidant shield which protects bacteria from the phagocytic oxidative burst. Cu,Zn-SODs protect Gram-negative bacteria against oxygen damage which have also been shown to contribute to the pathogenicity of these bacterial species. We report the presence of SodC in drug resistant Salmonella sp. isolated from patients suffering from enteric fever. Further sodC was amplified, cloned into Escherichia coli and the nucleotide sequence and amino acid sequence homology were compared with the standard strain Salmonella Typhimurium 14028. Methods: Salmonella enterica serovar Typhi (S. Typhi) and Salmonellaenterica serovar Paratyphi (S. Paratyphi) were isolated and identified from blood samples of the patients. The isolates were screened for the presence of Cu, Zn-SOD by PAGE using KCN as inhibitor of Cu,Zn-SOD. The gene (sodC) was amplified by PCR, cloned and sequenced. The nucleotide and amino acid sequences of sodC were compared using CLUSTAL X.Results: SodC was detected in 35 per cent of the Salmonella isolates. Amplification of the genomic DNA of S. Typhi and S. Paratyphi with sodC specific primers resulted in 519 and 515 bp amplicons respectively. Single mutational difference at position 489 was observed between thesodC of S. Typhi and S. Paratyphi while they differed at 6 positions with the sodC of S. Typhimurium 14028. The SodC amino acid sequences of the two isolates were homologous but 3 amino acid difference was observed with that of standard strain S. Typhimurium 14028.Interpretation & conclusions: The presence of SodC in pathogenic bacteria could be a novel candidate as phylogenetic marker.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The nucleotide sequence of cosmid B1790, carrying the Rif-Str regions of the Mycobacterium leprae chromosome, has been determined. Twelve open reading frames were identified in the 36716bp sequence, representing 40% of the coding capacity. Five ribosomal proteins, two elongation factors and the β and β'subunits of RNA polymerase have been characterized and two novel genes were found. One of these encodes a member of the so-called ABC family of ATP-binding proteins while the other appears to encode an enzyme involved in repairing genomic lesions caused by free radicals. This finding may well be significant as M. leprae, an intracellular pathogen, lives within macrophages.